Videcoconferencing method and system for connecting a host with a plurality of participants

ABSTRACT

An improved videoconferencing system and method are provided for connecting a host to a plurality of participants. Unlike other videoconferencing methods and systems, the videoconferencing system and method of these preferred embodiments allow each of the parties of a videoconference to benefit from the interaction between the host and the selected participant without limiting the number of participants that can join the videoconference. The host and participants can interact with the videoconferencing system by issuing voice commands or DTMF signals to a voice response unit or by using a computer to access the videoconferencing system directly or indirectly though a network such as the World Wide Web.

RELATED APPLICATIONS

The present patent document is a continuation of U.S. patent application Ser. No. 11/183,716 (still pending), filed Jul. 18, 2005, now U.S. Pat. No. 7,369,153 which is a continuation of U.S. patent application Ser. No. 10/911,206 (now U.S. Pat. No. 6,947,068), filed Aug. 3, 2004, which is a continuation of U.S. patent application Ser. No. 10/175,511 (now U.S. Pat. No. 6,801,241), filed Jun. 18, 2002, which is a continuation of U.S. patent application Ser. No. 09/222,592 (now U.S. Pat. No. 6,433,813), filed Dec. 28, 1998, the entirety of each of which is hereby incorporated by reference.

TECHNICAL FIELD

The present invention relates generally to videoconferencing services in which video and audio signals of at least one of the parties of a videoconference are transmitted to and received by at least one other party of the videoconference

BACKGROUND

In many videoconferences, one of the parties is the “host,” who is typically the center or focus of the videoconference, and the other parties are the “participants.” For example, the host can be a teacher located in one classroom, and the participants can be students located in a remote classroom. Although there are several techniques that can be used to connect the host with the participants, many of these techniques limit the type of host-participant interaction and/or the number of parties that can participate in the videoconference.

In one videoconferencing technique, the video and audio signals of the host are broadcast to each of the participants. This technique finds particular application in mass broadcasting, such as when a host is delivering a lecture to a large number of participants, because there is virtually no limit on the number of parties that can participate in the videoconference. Since the host does not receive video and audio signals from any of the participants, one disadvantage associated with this technique is that there is no interaction between the host and the participants. In a modified version of this technique, the host can receive video and audio signals from a selected participant. Although this modified technique allows the host to interact with one participant, the other participants to the videoconference do not receive the video and audio signals of the selected participant and are, therefore, excluded from the interaction.

To address this problem, another videoconferencing technique has been developed in which the video and audio signals of the host and each of the participants are combined and presented to each of the parties of the videoconference. For example, in a videoconference with a host and three participants, the image transmitted to each of the parties can be a four-quadrant image with the video signal of each participant shown in a respective quadrant. While this videoconferencing technique allows each party of the videoconference to interact with one another, the number of parties that can participate in the videoconference is limited to the number of parties that can be displayed in the transmitted image (four in the example given above).

There is, therefore, a need for an improved videoconferencing system and method for connecting a host with a plurality of participants.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is an illustration of a video terminal and camera that can be used with a presently preferred embodiment of this invention.

FIG. 2 is a block diagram of a videoconferencing system of a presently preferred embodiment.

FIG. 3 is a block diagram showing a preferred implementation of a videoconferencing unit of the preferred videoconferencing system of FIG. 2.

FIG. 4 is a flow chart of a preferred method for connecting a host with a plurality of participants.

FIG. 5 is an illustration of video images received by three participants in a videoconference of a preferred embodiment.

FIG. 6 is a flow chart of a preferred method for interacting with a videoconferencing control unit to add a participant to a videoconferencing queue.

FIG. 7 is a flow chart of a preferred method for interacting with a videoconferencing control unit to select a participant from a videoconferencing queue.

FIG. 8 is an illustration of video images received by a host and three participants in a videoconference of a preferred embodiment in which the host is interacting with the second participant.

FIG. 9 is an illustration of video images received by a host and three participants in a videoconference of a preferred embodiment in which the host is interacting with the third participant.

FIG. 10 is an illustration of video images received by a host and three participants in a videoconference of a preferred embodiment in which the host is interacting with the first participant.

DETAILED DESCRIPTION OF THE DRAWINGS

By way of introduction, the preferred embodiments described below include a videoconferencing system and method for connecting a host to a plurality of participants. Each of the parties of the videoconference has a video terminal 100 to receive and present video and audio signals from one or more of the other parties and has a camera 110 to transmit video and audio signals to one or more of the other parties (FIG. 1). Unlike the videoconferencing techniques described in the Background section, the videoconferencing system and method described below allow each of the parties of a videoconference to benefit from the interaction between the host and the selected participant without limiting the number of participants that can join the videoconference. The host and participants can interact with the videoconferencing system by issuing voice commands or DTMF signals to a voice response unit or by using a computer to access the videoconferencing system directly or indirectly though a network such as the World Wide Web.

Turning again to the drawings, FIG. 2 is an illustration of a videoconferencing network 200 of a presently preferred embodiment. Preferably, the videoconferencing network 200 is implemented as a public, switched, broadband video network operative to provide a full-motion videoconference. As shown in FIG. 2, this network 200 comprises video switches 210, 220, 230, 240 associated with a host and three participants (participant 1, participant 2, and participant 3), each with their own video terminal and camera for receiving and transmitting video and audio signals to the respective switches. While FIG. 2 shows each video switch being associated with a single party, a single video switch can be associated with two or more parties.

As shown by the solid connecting lines in FIG. 2, the video switches 210, 220, 230, 240 are coupled with a tandem video switch 250 to transfer video and audio signals. As used herein, the term “coupled with” means directly coupled with or indirectly coupled with through one or more components. The tandem video switch 250 transfers video and audio signals among the video switches 210, 220, 230, 240 and can also transfer audio and video signals to and from other video networks. Some or all of the components shown in FIG. 2 can be located in these other video networks. Preferably, fiber optic cabling is used to transfer the video and audio signals. Suitable video switches and tandem video switches can be purchased from Ditek.

A control unit 270 is coupled with the video switches 210, 220, 230, 240 and the tandem video switch 250 and provides these switches with control signals (shown by dashed lines). The control unit 270 comprises a controller 272 and also comprises a voice response unit 274 and modem 276 to connect the control unit 270 to an external telephone or computer, respectively. The control unit 270 directs the tandem video switch 250 to provide the videoconferencing unit 260 with video and audio signals from one or more of the parties. As shown in FIG. 3, the videoconferencing unit 260 comprises a first and second diplexer 310, 320 and a first and second video/audio processor 330, 340.

To schedule a videoconference, information identifying the host and the participants, as well as the time of the videoconference, is entered into the control unit 270. Preferably, the videoconference is scheduled by using a computer via the modem 276 of the control unit 270, although the VRU 274 of the control unit 270 can also be used. After the information is entered, the control unit 270 reserves the necessary components in the system 200 for the videoconference. At this time, the controller 272 creates a queue for listing participants who request to interact with the host, as described below.

FIG. 4 is a flow chart of a method of a preferred embodiment for connecting a host with a plurality of participants using the videoconferencing network of FIG. 2. As shown in FIG. 4, this method comprises transmitting audio and video signals of a host to a plurality of participants (410), adding at least one participant into a queue (420), selecting a participant from the queue (430), transmitting audio and video signals of the host to the selected participant (440), transmitting audio and video signals of the selected participant to the host (450), combining the audio and video signals of the selected participant with the audio and video signals of the host (460), and transmitting the combined audio and video signals to the participants who were not selected (470). This method will be described in detail below.

First, at the start of the videoconference, the control unit 270 directs the tandem video switch 250 and videoconferencing unit 260 to transmit video and audio signals of the host from video switch 210 to participants 1, 2, and 3 at video switches 220, 230, 240 (410). The video images transmitted to these participants are shown in FIG. 5. If a participant has a question or a response to the host's presentation, for example, the participant can contact the control unit 270 and request a connection with the host. For example, using a telephone, a participant can contact the VRU 274 of the control unit 270 and issue voice commands or DTMF signals to request a connection with the host. Alternatively, the participant can establish a computer connection to the control unit 270 via the modem 276 either directly or indirectly through a computer network such as the World Wide Web.

FIG. 6 is a flow chart of a preferred method for interacting with the control unit 270 using the VRU 274. When a participant connects to the VRU 274, the controller 272 determines if the queue has been activated by the host (610). The host may keep the queue inactive, for example, during the lecture phase of a presentation and may activate the queue during the question-and-answer phase of a presentation. If the queue is inactive, the participant is informed that the queue is inactive, and the call is terminated (620). If the queue is active, the controller 272 next determines if the participant is currently in the queue (630). If the participant is not in the queue, the controller 272 adds the participant into the queue (420), (640) If the participant is already in the queue, the controller 272 removes the participant from the queue (650). Preferably, participants are added into the queue in a first-in-first-out (“FIFO”) manner.

Once participants have been added into the queue, the host can select a participant from the queue (430). FIG. 7 is a flow chart of a preferred method for interacting with the control unit 270 to select a participant from the queue. As shown in FIG. 7, when the host connects to the control unit 270, the controller 272 determines if there are any participants in the queue (710) and provides the host with an indication of the presence or absence of participants. For example, if there are participants in the queue, the controller 272 can instruct the VRU 274 to play a ringing sound to the host (720). If there are no participants in the queue, the controller 272 can instruct the VRU 274 to play a short beep every few seconds (730). Next, the controller 272 provides the host with a listing of options (740). In this preferred embodiment, the options include: (1) viewing the next participant, (2) returning to broadcast session, (3) viewing the previous participant, (4) viewing a particular participant, (5) changing the size of the queue, (6) activating/deactivating the queue, (7) clearing the queue, (8) toggling the PIP/PUP display, and (9) turning off/on the audio prompting listing the preceding options. The host can then select an option, such as (1) or (4) above, that connects him to one of the participants.

As described above, the video and audio signals of each party of the videoconference is sent from the tandem switch 250 to the videoconferencing unit 260. When a participant is selected, the control unit 270 instructs the videoconferencing unit 260 to return unaltered video and audio signals of the host and selected participant to the tandem video switch 250. The control unit 270 then instructs the tandem video switch 250 to transmit the video and audio signal of the host to the selected participant (440) and the video and audio signals of the selected participant to the host (450). In this way, the host and the selected participant can interact as if the videoconference were merely a point-to-point videoconference between the host and the selected participant.

The control unit 270 also instructs the videoconferencing unit 260 to combine the video and audio signals of the host and the selected participant (460). With reference again to FIG. 3, the video and audio signals of the host and the video and audio signals of the selected participant are received by the first and second diplexer 310, 320, respectively, which separates the video and audio signals for the host and the selected participant and sends the separated signals to the first video/audio processor 330. In this preferred embodiment, the first video/audio processor 330 combines the audio signals of the host and the selected participant and returns the combined audio signals to the first diplexer 310.

The first video/audio processor 330 also alters the video signals of the host and the selected participant and sends them to the second video/audio processor 340 for combination. The video signals can be combined as a picture-in-picture or picture-under-picture, or in any other suitable manner such as having the host appear on the left side of the display and the selected participant appear on the right side of the display. In operation, the first video/audio processor 330 scales the video images appropriately, and the second video/audio processor 340 combines the scaled images. For example, to combine the host and selected participant in a picture-in-picture fashion with the host in the background and selected participant in a smaller area in the foreground, the first video/audio processor 330 creates the smaller image of the selected participant, and the second video/audio processor 340 overlays the smaller image on top of the larger image of the host. The combined video signals are then returned to the first diplexer 310, which merges the combined audio signals from the first video/audio processor 330 and the combined video signals from the second videolaudio processor 340. The combined signals are returned to the tandem video switch 250, which is instructed by the control unit 270 to transmit the combined audio and video signals to the participants who were not selected by the host (470).

The operation of this method will be described in conjunction with FIGS. 8-10. Assume that participants 2, 3, and 1, in that order, each request a connection to the host and are entered into the queue. When the host selects the first participant (participant 2) in the queue, the host and participant 2 will each receive the video and audio signals of the other, as shown in FIG. 8. If the videoconferencing unit 260 is instructed by the control unit 270 to combine the host and participant 2 with the host in the background and participant 2 in a smaller area in the foreground (i.e., picture-in-picture), the tandem video switch 250 sends the picture-in-picture combination of the host and participant 2 to participants 1 and 3, as shown in FIG. 8. Because the interaction between the host and the selected participant is broadcast to the other parties of the videoconference, each party can benefit from the interaction, unlike previous videoconferencing techniques. Additionally, because the video images of only the host and the selected participant are combined, the number of parties who can participate in the videoconference is not limited as in some videoconferencing techniques that require each of the parties of the videoconference to be displayed on a single screen.

An additional advantage associated with these preferred embodiments is that the host can easily navigate among the participants in the queue. For example, if participant 2 has been selected, the host can select the next participant in the queue, causing the host and participant 3 to each receive the video and audio signals of the other and causing participants 1 and 2 to receive the combined video and audio of the host and participant 3, as shown in FIG. 9. If the host then selects the previous participant in the queue, each party will receive the signals shown in FIG. 8. If the host instead selects the next participant, participant 1 is selected, as shown in FIG. 10. In FIG. 10, the combination technique used is a picture-under-picture instead of a picture-in-picture. The host can also return to broadcast mode, as shown in FIG. 5, or can select a specific participant. For example, the host can return to participant 2, as shown in FIG. 8.

There are several alternatives to the preferred embodiments described above. For example, instead of receiving and transmitting video and audio signals of each party of the videoconference, the videoconferencing unit 260 can process only those video and audio signals that are to be combined, and the video tandem switch 250 can handle the other signals. Additionally, while the above preferred embodiments have been described in conjunction with a queue, it is important to note that the host can select any participant to the video conference even if the participant is not in the queue or if the queue is inactive or not created.

For simplicity, the term “controller” is meant to broadly refer to hardware and/or software that can be used to perform the functions described above with respect to the control unit 270. For example, the controller 272 can comprise a CPU to execute a software program held in the memory of the controller 272. Of course, any appropriate software language and any appropriate hardware, analog or digital, now in existence or later developed, can be used. Additionally, the functionality described above can be implemented exclusively with hardware, or a computer-usable medium having computer readable program code embodied therein can be used. It is preferred that the controller be a RS 6000 computer.

It is intended that the foregoing detailed description be understood as an illustration of selected forms that the invention can take and not as a definition of the invention. It is only the following claims, including all equivalents, that are intended to define the scope of this invention. 

1. A method comprising: receiving, by a first party device, a request from a second party device; transmitting, by the first party device, video of the first party device to the second party device; receiving, by the first party device, video from the second party device, wherein while receiving video from the second party device, the first party device does not receive video from any of a plurality of party devices other than the second party device; and transmitting, by the first party device, combined video of the first party device and the second party device to the plurality of party devices other than the second party device.
 2. The method of claim 1, wherein the second party device does not receive video from other plurality of party devices.
 3. The method of claim 1, wherein the second party device does not receive its own video.
 4. The method of claim 1, wherein the first party device does not receive its own video.
 5. The method of claim 1, wherein the request from the second party device is received via the World Wide Web.
 6. The method of claim 1, comprising: transmitting, by the first party device, audio of the first party device to the second party device; receiving, by the first party device, audio of the second party device, wherein the first party device does not receive audio from the plurality of party devices other than the second party device; and transmitting, by the first party device, combined audio of the first and second parties to the parties of the plurality of parties other than the second party.
 7. The method of claim 6, wherein the second party device does not receive audio from other party devices of the plurality of party devices.
 8. The method of claim 1, wherein the plurality of party devices comprises a third party device, and wherein the method comprises: receiving, by the first party device, a second request from the third party device; transmitting, by the first party device, video of the first party device to the third party device; receiving, by the first party device, video of the third party device, wherein while receiving video from the third party device, the first party device does not receive video from the plurality of party devices other than the third party device; and transmitting, by the first party device, combined video of the first party device and third party device to the plurality of party device other than the third party device.
 9. The method of claim 1, further comprising: adding, by the first party device, the second party device to a queue in response to receiving the request from the second party device; and selecting, by the first party device, the second party device from the queue; wherein the acts of transmitting video of the first party device to the second party device, receiving video of the second party device at the first party device, and transmitting combined video of the first party device and the second party device are performed in response to selecting the second party device from the queue.
 10. A non-transitory computer-readable storage medium comprising computer instructions, when executed by at least one processor, cause the at least one processor to perform operations comprising: receiving a request from a second party device; transmitting video of a first party device to the second party device; receiving video of the second party device at the first party device, wherein while receiving video of the second party device, the first party device does not receive video from a plurality of devices other than the second party device; and transmitting combined video of the first and second party devices to the plurality of devices other than the second party device.
 11. The non-transitory computer-readable storage medium of claim 10, wherein the request from the second party device is received over the World Wide Web.
 12. The non-transitory computer-readable storage medium of claim 10, comprising computer instructions which when executed by the at least one processor cause the at least one processor to perform operations comprising: adding the second party device to a queue in response to receiving the request from the second party device; and selecting the second party device from the queue; wherein the acts of transmitting video of the first party to the second party device, receiving video of the second party device at the first party device, and transmitting combined video of the first and second devices is performed in response to selecting the second party device from the queue.
 13. The non-transitory computer-readable storage medium of claim 10, comprising computer instructions which when executed by the at least one processor cause the at least one processor to perform operations comprising: receiving a second request from a third party device; transmitting video of the first party device to the third party device; receiving video of the third party device at the first party device, wherein while receiving video of the third party device, the first party device does not receive video from the plurality of devices other than the third party device; and transmitting combined video of the first and third devices the plurality of devices other than the third party device.
 14. A system comprising: a network element coupled to a network coupling a first party device with a plurality of party devices comprising a second party device, the network element comprising: a memory storing computer instructions; and a processor coupled to the memory, wherein the processor, responsive to executing the comprising instructions, performs operations comprising: receiving a request from the second party device; transmitting video of the first party device to the second party device; wherein while receiving video of the second party device, the first party does not receive video from the plurality of party devices other than the second party device; and transmitting combined video of the first and second party devices to the plurality of party devices other than the second party device.
 15. The system of claim 14, wherein the request from the second party device is received over the World Wide Web.
 16. The system of claim 14, wherein the executing the computer instructions, performs operations comprising: adding the second party to a queue in response to receiving the request from the second party device; and selecting the second party device from the queue; wherein the acts of transmitting video of the first party device to the second party device, receiving video of the second party device at the first party device, and transmitting combined video of the first and second party devices are performed in response to selecting the second party device from the queue.
 17. The system of claim 14, wherein the processor, responsive to executing the computer instructions, performs operations comprising: receiving a second request from a third party device; transmitting video of the first party device to the third party device; receiving video of the third party device at the first party device, wherein while receiving video of the third party device, the first party device does not receive video from the plurality of party devices other than the third party device; and transmitting combined video of the first and third party devices to the plurality of party devices other than the third party device. 